Goto

Collaborating Authors

 Bandung


Data-Driven Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations

Palar, Pramudita Satria, Saves, Paul, Regis, Rommel G., Shimoyama, Koji, Obayashi, Shigeru, Verstaevel, Nicolas, Morlier, Joseph

arXiv.org Machine Learning

Explainable machine learning techniques have gained increasing attention in engineering applications, especially in aerospace design and analysis, where understanding how input variables influence data-driven models is essential. Partial Dependence Plots (PDPs) are widely used for interpreting black-box models by showing the average effect of an input variable on the prediction. However, their global sensitivity metric can be misleading when strong interactions are present, as averaging tends to obscure interaction effects. To address this limitation, we propose a global sensitivity metric based on Individual Conditional Expectation (ICE) curves. The method computes the expected feature importance across ICE curves, along with their standard deviation, to more effectively capture the influence of interactions. We provide a mathematical proof demonstrating that the PDP-based sensitivity is a lower bound of the proposed ICE-based metric under truncated orthogonal polynomial expansion. In addition, we introduce an ICE-based correlation value to quantify how interactions modify the relationship between inputs and the output. Comparative evaluations were performed on three cases: a 5-variable analytical function, a 5-variable wind-turbine fatigue problem, and a 9-variable airfoil aerodynamics case, where ICE-based sensitivity was benchmarked against PDP, SHapley Additive exPlanations (SHAP), and Sobol' indices. The results show that ICE-based feature importance provides richer insights than the traditional PDP-based approach, while visual interpretations from PDP, ICE, and SHAP complement one another by offering multiple perspectives.


Coverage-Recon: Coordinated Multi-Drone Image Sampling with Online Map Feedback

Hanif, Muhammad, Terunuma, Reiji, Sumino, Takumi, Cheng, Kelvin, Hatanaka, Takeshi

arXiv.org Artificial Intelligence

Achieving high-quality reconstruction requires capturing images of keypoints within the target scene from diverse viewing angles, and coverage control offers an effective framework to meet this requirement. Meanwhile, recent advances in real-time 3D reconstruction algorithms make it possible to render an evolving map during flight, enabling immediate feedback to guide drone motion. Building on this, we present Coverage-Recon, a novel coordinated image sampling algorithm that integrates online map feedback to improve reconstruction quality on-the-fly. In Coverage-Recon, the coordinated motion of drones is governed by a Quadratic Programming (QP)-based angle-aware coverage controller, which ensures multi-viewpoint image capture while enforcing safety constraints. The captured images are processed in real time by the NeuralRecon algorithm to generate an evolving 3D mesh. Mesh changes across the scene are interpreted as indicators of reconstruction uncertainty and serve as feedback to update the importance index of the coverage control as the map evolves. The effectiveness of Coverage-Recon is validated through simulation and experiments, demonstrating both qualitatively and quantitatively that incorporating online map feedback yields more complete and accurate 3D reconstructions than conventional methods.


BenchRL-QAS: Benchmarking reinforcement learning algorithms for quantum architecture search

Ikhtiarudin, Azhar, Das, Aditi, Thakkar, Param, Kundu, Akash

arXiv.org Artificial Intelligence

Our study systematically evaluates 9 different RL agents, including both value-based and policy-gradient methods, on quantum problems such as variational eigensolver, quantum state diagonalization, vari-ational quantum classification (VQC), and state preparation, under both noiseless and noisy execution settings. To ensure fair comparison, we propose a weighted ranking metric that integrates accuracy, circuit depth, gate count, and training time. Results demonstrate that no single RL method dominates universally, the performance dependents on task type, qubit count, and noise conditions providing strong evidence of no free lunch principle in RL-QAS. As a byproduct we observe that a carefully chosen RL algorithm in RL-based VQC outperforms baseline VQCs. BenchRL-QAS establishes the most extensive benchmark for RL-based QAS to date, codes and experimental made publicly available for reproducibility and future advances.


Mobile Robot Localization via Indoor Positioning System and Odometry Fusion

Nugraha, Muhammad Hafil, Abdul, Fauzi, Bramantyo, Lastiko, Rijanto, Estiko, Saputra, Roni Permana, Mahendra, Oka

arXiv.org Artificial Intelligence

Muhammad Hafil Nugraha Research Centre for Smart Mechatronics National Research and Innovation Agency Bandung, Indonesia muha167@brin.go.id Estiko Rijanto Research Centre for Smart Mechatronics National Research and Innovation Agency Bandung, Indonesia estiko.rijanto@brin.go.id Oka Mahendra Research Centre for Smart Mechatronics National Research and Innovation Agency Bandung, Indonesia oka.mahendra@brin.go.id Abstract -- Accurate localization is crucial for effectively operating mobile robots in indoor environments. This paper presents a comprehensive approach to mobile robot localization by integrating an ultrasound - based indoor positioning system (IPS) with wheel odometry data via sensor fusion techniques. The Extended Kalman Filter (EKF) fusion method combines the data from the IPS sensors and the robot's wheel odometry, providing a robust and relia ble localization solution. Extensive experiments in a controlled indoor environment reveal that the fusion - based localization system significantly enhances accuracy and precision compared to standalone systems.


Persistent Homology of Topic Networks for the Prediction of Reader Curiosity

Hopp, Manuel D. S., Labatut, Vincent, Amalvy, Arthur, Dufour, Richard, Stone, Hannah, Jach, Hayley, Murayama, Kou

arXiv.org Artificial Intelligence

Reader curiosity, the drive to seek information, is crucial for textual engagement, yet remains relatively underexplored in NLP. Building on Loewenstein's Information Gap Theory, we introduce a framework that models reader curiosity by quantifying semantic information gaps within a text's semantic structure. Our approach leverages BERTopic-inspired topic modeling and persistent homology to analyze the evolving topology (connected components, cycles, voids) of a dynamic semantic network derived from text segments, treating these features as proxies for information gaps. To empirically evaluate this pipeline, we collect reader curiosity ratings from participants (n = 49) as they read S. Collins's ''The Hunger Games'' novel. We then use the topological features from our pipeline as independent variables to predict these ratings, and experimentally show that they significantly improve curiosity prediction compared to a baseline model (73% vs. 30% explained deviance), validating our approach. This pipeline offers a new computational method for analyzing text structure and its relation to reader engagement.


VisuCraft: Enhancing Large Vision-Language Models for Complex Visual-Guided Creative Content Generation via Structured Information Extraction

Jiang, Rongxin, Long, Robert, Gu, Chenghao, Yan, Mingrui

arXiv.org Artificial Intelligence

This paper introduces VisuCraft, a novel framework designed to significantly enhance the capabilities of Large Vision-Language Models (LVLMs) in complex visual-guided creative content generation. Existing LVLMs often exhibit limitations in maintaining high visual fidelity, genuine creativity, and precise adherence to nuanced user instructions when generating long-form texts. VisuCraft addresses these challenges by integrating a multimodal structured information extractor (E) and a dynamic prompt generation module (G). The extractor distills fine-grained visual attributes from input images into a rich, structured representation, which the dynamic prompt module then combines with user instructions to create highly optimized prompts for underlying LVLMs (e.g., LLaVA, InstructBLIP). Evaluated on the self-constructed ImageStoryGen-500K dataset using VisuGen Metrics (Visual Grounding, Creativity, and Instruction Adherence), VisuCraft consistently outperforms baseline LVLMs across tasks like story generation and poetry composition. Our results demonstrate remarkable improvements, particularly in creativity and instruction adherence, validating VisuCraft's effectiveness in producing imaginative, visually grounded, and user-aligned long-form creative text. This work unlocks new potential for LVLMs in sophisticated creative AI applications.


Survey of Swarm Intelligence Approaches to Search Documents Based On Semantic Similarity

Muniyappa, Chandrashekar, Kim, Eunjin

arXiv.org Artificial Intelligence

Swarm Intelligence (SI) is gaining a lot of popularity in artificial intelligence, where the natural behavior of animals and insects is observed and translated into computer algorithms called swarm computing to solve real-world problems. Due to their effectiveness, they are applied in solving various computer optimization problems. This survey will review all the latest developments in Searching for documents based on semantic similarity using Swarm Intelligence algorithms and recommend future research directions.


AI-Based Demand Forecasting and Load Balancing for Optimising Energy use in Healthcare Systems: A real case study

Rahimi, Iman, Patel, Isha

arXiv.org Artificial Intelligence

- This paper addresses the critical need for efficient energy management in healthcare facilities, where fluctuating energy demands challenge both operational and sustainability goals. Traditional energy management methods often fall short in healthcare settings, lead ing to inefficiencies and increased costs. To address this, the paper explores AI - driven approaches for demand forecasting and load balancing, introducing a novel integration of LSTM (Long Short - Term Memory), g enetic a lgorithm, and SHAP (Shapley Additive E xplanations) specifically tailored for healthcare energy management. While LSTM has been widely used for time - series forecasting, its application in healthcare energy demand prediction is underexplored. Here, LSTM is demonstrated to significantly outperfor m ARIMA and Prophet models in handling complex, non - linear demand patterns. Results show that LSTM achieved a Mean Absolute Error (MAE) of 21.69 and Root Mean Square Error (RMSE) of 29.96, significantly improving upon Prophet (MAE: 59.78, RMSE: 81.22) and ARIMA (MAE: 87.73, RMSE: 125.22), highlighting its superior forecasting capability. Genetic algorithm is employed not only for optimising forecasting model parameters but also for dynamically improving load balancing strategies, ensuring adaptability to real - time energy fluctuations. Additionally, SHAP analysis is used to interpret the models and understan d the impact of various input features on predictions, enhancing model transparency and trustworthiness in energy decision - making. The combined LSTM - GA - SH AP approach offers a comprehensive framework that improves forecasting accuracy, enhances energy efficiency, and supports sustainability in healthcare environments. Future work could focus on real - time implementation and further hybridisation with reinforc ement learning for continuous optimisation. This study establishes a strong foundation for leveraging AI in healthcare energy management, showcasing its potential for scalability, efficiency, and resilience. Introduction Australia has a big capacity of using renewable energy in different regions ( Holloway, R, 2023; Rahimi et al., 2025) . Australian healthcare system plays a major role in using renewable energies. Optimising energy use in healthcare systems is essential due to the high and often unpredictable energy demands needed to run medical equipment, keep environmental conditions stable, and support constant patient care.


Natural language processing for African languages

Adelani, David Ifeoluwa

arXiv.org Artificial Intelligence

Recent advances in word embeddings and language models use large-scale, unlabelled data and self-supervised learning to boost NLP performance. Multilingual models, often trained on web-sourced data like Wikipedia, face challenges: few low-resource languages are included, their data is often noisy, and lack of labeled datasets makes it hard to evaluate performance outside high-resource languages like English. In this dissertation, we focus on languages spoken in Sub-Saharan Africa where all the indigenous languages in this region can be regarded as low-resourced in terms of the availability of labelled data for NLP tasks and unlabelled data found on the web. We analyse the noise in the publicly available corpora, and curate a high-quality corpus, demonstrating that the quality of semantic representations learned in word embeddings does not only depend on the amount of data but on the quality of pre-training data. We demonstrate empirically the limitations of word embeddings, and the opportunities the multilingual pre-trained language model (PLM) offers especially for languages unseen during pre-training and low-resource scenarios. We further study how to adapt and specialize multilingual PLMs to unseen African languages using a small amount of monolingual texts. To address the under-representation of the African languages in NLP research, we developed large scale human-annotated labelled datasets for 21 African languages in two impactful NLP tasks: named entity recognition and machine translation. We conduct an extensive empirical evaluation using state-of-the-art methods across supervised, weakly-supervised, and transfer learning settings.


'One day I overheard my boss saying: just put it in ChatGPT': the workers who lost their jobs to AI

The Guardian

I've been a freelance journalist for 10 years, usually writing for magazines and websites about cinema. I presented a morning show on Radio Kraków twice a week for about two years. It was only one part of my work, but I really enjoyed it. It was about culture and cinema, and featured a range of people, from artists to activists. I remember interviewing Ukrainians about the Russian invasion for the first programme I presented, back in 2022. I was let go in August 2024, alongside a dozen co-workers who were also part-time. We were told the radio station was having financial problems.